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Abstract: We propose a novel probabilistic method for detection of ob- 
jects in noisy images. The method uses results from percolation and random 
graph theories. We present an algorithm that allows to detect objects of un- 
known shapes in the presence of random noise. Our procedure substantially 
differs from wavelets- based algorithms. The algorithm has linear complex- 
ity and exponential accuracy and is appropriate for real-time systems. We 
prove results on consistency and algorithmic complexity of our procedure. 
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1. Introduction 

In this paper, we propose a new efficient teclinique for quick detection of objects 
in noisy images. Our approach uses mathematical percolation theory. 

Detection of objects in noisy images is the most basic problem of image analy- 
sis. Indeed, when one looks at a noisy image, the first question to ask is whether 
there is any object at all. This is also a prim ary question of interest in such 
diverse fields as, for exa mple, cancer detec tion (|Ricci-Vitiani et al. au- 
tomated urban analysis (INegri et all ( 2006f )). detection of cracks in buried pipes 
( Sinha and Fieeutbl (2006)), and other possible applications in astronomy, elec- 
tron microscopy and neurology. Moreover, if there is just a random noise in the 
picture, it doesn't make sense to run computationally intensive procedures for 
image reconstruction for this particular picture. Surprisingly, the vast majority 
of image analysis methods, both in statistics and in engineering, skip this stage 
and start immediately with image reconstruction. 

The crucial difference of our method is that we do not impose any shape 
or smoothness assumptions on the boundary of the object. This permits the 
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detection of nonsmooth, irregular or disconnected objects in noisy images, under 
very mild assumptions on the object's interior. This is especially suitable, for 
example, if one has to detect a highly irregular non-convex object in a noisy 
image. Although our detection procedure works for regular images as well, it is 
precisely the class of irregular images with unknown shape where our method 
can be very advantageous. 

Many modern methods of object detection, especially the ones that are used 
by practitioners in medical image analysis require to perform at least a prelim- 
inary reconstruction of the image in order for an object to be detected. This 
usually makes such methods difficult for a rigorous analysis of performance and 
for error control. Our approach is fre e from this drawback. Eve n though some 
papers work with a similar setup fsee lArias-Castro et al.l (|2005l) V both our ap- 
proach and our results differ substantially from this and other studies of the 
subject. We also do not use any wavelet-based techniques in the present paper. 

We view the object detection problem as a nonparametric hypothesis testing 
problem within the class of discrete statistical inverse problems. 

In this paper, we propose an algorithmic solution for this nonparametric hy- 
pothesis testing problem. We prove that our algorithm has linear complexity 
in terms of the number of pixels on the screen, and this procedure is not only 
asymptotically consistent, but on top of that has accuracy that grows exponen- 
tially with the "number of pixels" in the object of detection. The algorithm has 
a built-in data-driven stopping rule, so there is no need in human assistance to 
stop the algorithm at an appropriate step. 

In this paper, we assume that the original image is black-and-white and that 
the noisy image is grayscale. While our focusing on grayscale images could have 
been a serious limitation in case of image reconstruction, it essentially does not 
affect the scope of applications in the case of object detection. Indeed, in the 
vast majority of problems, an object that has to be detected either has (on the 
picture under analysis) a color that differs from the background colours (for 
example, in roads detection), or has the same colour but of a very different 
intensity, or at least an object has a relatively thick boundary that differs in 
colour from the background. Moreover, in practical applications one often has 
some prior information about colours of both the object of interest and of the 
background. When this is the case, the method of the present paper is applicable 
after simple rescaling of colour values. 

The paper is organized as follows. In Section [5] we describe some general 
assumptions that one makes in statistical processing of digital images. The sta- 
tistical model itself is described in details in Section [31 Suitable thresholding for 
noisy images is crucial in our method and is developed in Section |4l Our algo- 
rithm for object detection is presented in Section [51 Theorem [His the main result 
about consistency and computational complexity of the algorithm. Section [Bl is 
devoted to the proof of the main theorem. 

2. Basic framework 

In this section we discuss some natural and very basic assumptions that we 
impose on our model. 

Suppose we have an analogous two-dimensional image. For numerical or 
graphical processing of images on computers, the image always has to be dis- 
cretized. This is achieved via a pixelization procedure. 
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A typical example of pixelization procedure is as follows. Consider an x iV 
grid on the square containing the image, and color black those and only those 
pixels for whose the pixel's interior has common points with the original image. 
The result is called a pixelized picture. 

After certain procedure of pixelization, each pixel gets a real number attached 
to it. This assumption means that our only data available are given as an array 
of real numbers {Yij}fj^i- In order to perform statistical image analysis, 
we will use only these A^^ numbers plus our model assumptions. This leads us 
to the following basis assumption. 

(Al) Assume that we have a square screen of N x N pixels, where we 
observe pixelized images. We assume that we are getting our 
information about the image from this screen alone. 

In the present paper we are interested in detection of objects that have a 
known colour. And this colour is different from the colour of the background. 
Mathematically, this can be roughly formulated as follows. 

(A2) The true (non-noisy) images are black-and-white. 

Indeed, we are free to assume that all the pixels that belong to the meaningful 
object within the digitalized image have the value 1 attached to them. We can 
call this value a black colour. Additionally, assume that the value is attached 
to those and only those pixels that do not belong to the object in the true image. 
If the number is attached to the pixel, we call this pixel white. 

In this paper we always assume that we observe a noisy image. The noise 
itself can be caused, for example, by channel defects or other transmission errors, 
distortions, etc. Medical scans provide a classic example of noisy images: the scan 
is always made indirectly, through the body. In astronomy, one also observes 
noisy images: there are optical and technological errors, atmosphere conditions, 
etc. The observed values on pixels could be different from and 1. This means 
that we will actually always have a greyscale image in the beginning of our 
analysis. 

(A3) On each pixel we have random noise that has the distribution function 
F, where F has mean and known variance cr > 0; the noise at 
each pixel is completely independent from noises on other pixels. 

An important special case is when the noise is normally distributed, i.e. F — 
N(0,a), where A(0,f7) stands for the normal distribution with mean and 
known variance cr > 0. However, in general, the noise doesn't need to be smooth, 
symmetric or even continuous. 

Remark 1. We limit ourselves to two-dimensional images only. However, our 
method makes it possible to analyze also n— dimensional images, for each n > 1. 

Remark 2. It doesn't really matter if the screen is square or rectangular. We 
consider a square screen only to simplify our notation. 

Remark 3. It is behind the scope of the present paper to discuss various ways 
to pixelize real-world analo gous images. One possible method of pixelization is 



often used in literature (see lArias-Castro et al.l (j2005l ) and related references) 
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The way of pixelization plays an important role mostly for those problems in 
image analysis where one needs to give asymptotic estimates for the boundary of 
the object. For example, consider pixelizing a plane curve. Then one can consider 
also colouring black those pixels that have at least 4 common points with the 
curve (not necessarily 4 interior points), etc. However, for simpler problems like 
image detection (or crude estimation of object's shape or interior) it is often not 
important how many pixels you colour at the boundary of your image. 

Remark 4. Our method works not only for normal or i.i.d. noise. We have 
chosen this type of noise in order to achieve relatively simple and explicit results. 
The method itself allows the treatment of more complicated situations, such as 
singular noise, discrete noise, pixel colour flipping, different noise in different 
screen areas, and dependent noise. 



3. Statistical Model 



Now we are able to formulate the model more formally. We have aii N x N 
array of observations, i.e. we observe real numbers {Yij}fj^i- Denote the 
true value on the pixel (i, j), ^ i, j < N , by Irriij, and the corresponding noise 
by (J£ij. Therefore, by our model assumptions, 

Yij = Irriij + asij , (1) 
where 1 < i, j < A^, cr > and, in accordance with assumption (A2), 



^ r 1, if belongs to the object; 

\ 0, if (i,j) does not belong to the object. ^ ' 

To stress the dependence on the noise level cr, we write assumption (^3) in the 
following way: 



£y-F, Eey^O, Vare,,=l. (3) 

The noise here doesn't need to be smooth, symmetric or even continuous. More- 
over, all the results below are easily transferred to the even more general case 
when the noise has arbitrary but known distribution function Fgen\ it is not 
necessary that the noise has mean and finite variance. The only adjustment 
to be made is to replace in all the statements quantities of the form F( • /cr) 
by the quantities Fgeni • ). The Algorithm 1 below and the main Theorem [1] are 
valid without any changes for a general noise distribution Fg^n satisfying 
and dll). 

Now we can proceed to preliminary quantitative estimates. If a pixel (i, j) is 
white in the original image, let us denote the corresponding probability distribu- 
tion of Yij by Fq . For a black pixel (i, j) we denote the corresponding distribution 
of Yij by Pi. We are free to omit dependency of Pq and Pi on i and j in our 
notation, since all the noises are independent and identically distributed. 
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Lemma 1. Suppose pixel has white colour in the original image. Then for 
all y G R; 

Po{Y^,>y) = l-F(y^, (4) 

where F is the distribution function of the standardized noise. 
Proof (Lemma [1|): By ©, 



Pq{Y,, >y)^l~ P{ae,j < y) ^ 1 - F 



□ 



Lemma 2. Suppose pixel {i,j) has black colour in the original image. Then for 
all y G K; 

Pi{Y.,<y)^F(^y-^y (5) 

Proof. (Lemma [2): By (jS]) again, we have 

Pi{Y^j <y)^P{l + cre.j < y) = P{ne,j < y - 1) - ^(^^) ■ 

□ 

4. Thresholding and Graphs of Images 

Now we are ready to describe one of the main ingredients of our method: the 
thresholding. The idea of the thresholding is as follows: in the noisy grayscale 
image jf^^x, we pick some pixels that look as if their real colour was black. 
Then we colour all those pixels black, irrespectively of the exact value of grey 
that was observed on them. We take into account the intensity of grey observed 
at those pixels only once, in the beginning of our procedures. The idea is to 
think that some pixel "seems to have a black colour" when it is not very likely 
to obtain the observed grey value when adding a " reasonable" noise to a white 
pixel. 

We colour white all the pixels that weren't coloured black at the previous 
step. At the end of this procedure, we would have a transformed vector of O's 
and I's, call it {^ij}fj=i- We will be able to analyse this transformed picture 
by using certain results from the mathematical theory of percolation. This is 
the main goal of the present paper. But first we have to give more details about 
the thresholding procedure. 

Let us fix, for each N , a real number ao{N) > 0, ao(A^) < 1, such that there 
exists 9{N) G K satisfying the following condition: 

P^{Y,, > e{N)) < ao(iV). (6) 
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Lemma 3. Assume that is satisfied for some 0{N) £ M. Then for the 
smallest possible 0{N) satisfying ^ it holds that 



f(^).l-„.(iV), (7) 

Proof. (Lemma [3]): Obvious by Lemma [TJ □ 

In this paper we will always pick ao{N) = ao for all N ^ N, for some 
constant ao > 0. But we will need to have varying ao( ■) for our future research. 

We are prepared to describe our thresholding princip le formally . Let p ^'*'^ 
be the critical probability for site percolation on fsee iGrimmetll ( 1999h for 
definitions). 

As a first step, we transform the observed noisy image in the 

following way: for all 1 < z, j < TV, 

1. If Yij > 9{N), set Yij := 1 (i.e., in the transformed picture the corre- 
sponding pixel is coloured black). 

2. If Yij < 0{N), set Yij :— (i.e., in the transformed picture the corre- 
sponding pixel is coloured white). 

Definition 1. The above transformation is called thresholding at the level d{N). 
The resulting vector {Yi,j}f^j^i of N'^ values (O's and I's) is called a thresholded 
picture. 

Suppose for a moment that we are given the original black and white image 
without noise. One can think of pixels from the original picture as of vertices 
of a planar graph. Furthermore, let us colour these iV^ vertices with the same 
colours as the corresponding pixels of the original image. We obtain a graph G 
with iV^ black or white vertices and (so far) no edges. 

We add edges to G in the following way. If any two black vertices are neigh- 
bours (i.e. the corresponding pixels have a common side) , we connect these two 
vertices with a black edge. If any two white vertices are neighbours, we connect 
them with a white edge. We will not add any edges between non-neighbouring 
points, and we will not connect vertices of different colours to each other. 

Finally, we see that it is possible to view our black and white pixelized picture 
as a collection of black and white " clusters" on the very specific planar graph 
(a square N x N subset of the lattice). 

Definition 2. We call graph G the graph of the (pure) picture. 

This is a very special planar graph, so there are many efficient algorithms 
to work with black and white components of the graph. Potentially, they could 
be used to efficiently process the picture. However, the above representation of 
the image as a graph is lost when one considers noisy images: because of the 
presence of random noise, we get many gray pixels. So, the above construction 
doesn't make sense anymore. We overcome this obstacle with the help of the 
above thresholding procedure. 

We make 6'(A^)— thresholding of the noisy image {Yij}f^j^i as in Definition 
[U but with a very special value of 0{N). Our goal is to choose 9{N) (and 
corresponding ao{N), see ^) such that: 

imsart-generic ver. 2007/04/13 file: Raiidomlzed_Algoritlims_aiid_P6rcolatlon_Squar6_3.tex date: January 13, 2013 



M. Langovoy and O. Wittich/Detection in noisy images and percolation. 7 



< pf% (8) 



cr 



pT < l~Fl^-^ ), (9) 



where ^^le cri tical probability for site percolation on Z (see iGrimmett 



(|l999l ). lKestCTl (|l982l )'l. In case if both dE]) and (0) are satisfied, what do we get? 



After applying the 0(A^)— thresholding on the noisy picture we 
obtained a (random) black-and-white image Let Gat be the graph 

of this image, as in Definition [2j 

Since Gat is a random, we actually observe the so-called site percolation on 
black vertices within the subset of Z^. From this point, we can use results from 
percolation theory to predict formation of black and white clusters on Gn , as 
well as to estimate the number of clusters and their sizes and shapes. Relations 
dHl) and (O are crucial here. 

To explain this more formally, let us split the set of vertices Vn oi the graph 
Gn into to groups: Vn = V^'^UV^"*, where ^^"nV^"* = 0, and V^^™ consists of 
those and only those vertices that correspond to pixels belonging to the original 
object, while V)^"* is left for the pixels from the background. Denote G™ the 
subgraph of Gat with vertex set V^™ , and denote G^* the subgraph of Gat with 
vertex set F^"*. 

If ([S]) and © are satisfied, we will observe a so-called supercritical percolation 
of black clusters on G™, and a suhcritical percolation of black clusters on G^*. 
Without going into much detai l s on p erc olation t heory (the necessary introduc- 
tion can be found in Grimmett (|l999l) or lKestenI (Il982h 'l. we mention that there 



will be a high probability of forming relatively large black clusters on G™ , but 
there will be only little and scarce black clusters on G^'. The difference between 
the two regions will be striking, and this is the main component in our image 
analysis method. 

In this paper, mathematical percolation theory will be used to derive quanti- 
tative results on behaviour of clusters for both cases. We will apply those results 
to build efficient randomized algorithms that will be able to detect and estimate 
the object {Imi j}^^^^^ using the difference in percolation phases on G™ and 

Gout 
N ■ 

If the noise level a is not too large, then (|S]) and ^ are satisfied for some 
9{N) S (0,1). Indeed, one simply has to pick 9{N) close enough to 1. On the 
other hand, if a is relatively large, it may happen that ([8]) and (|9]) cannot both 
be satisfied at the same time. 

Definition 3. In the framework defined by relations (HJ-© and assumptions 
{Al) - (A3), we say that the noise level a is small enough (or 1-small), if the 
system of inequalities (O and (O is satisfied for some 0{N) £ M, for all G N. 

A very important practical issue is that of choosing an optimal threshold 
value 9. From a purely theoretical point of view, this is not a big issue: once 
dS]) and © holds for some 9, it is guaranteed that after 0— thresholding we will 
observe qualitatively different behaviour of black and white clusters in or outside 
of the true object. We will make use of this in what follows. 

However, for practical computations, especially for moderate values of A, the 
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value of is important. Since the goal is to make percolations on V™ and V^"* 
look as different as possible, one has to make the corresponding percolation 
probabilities for black colour, namely. 



as different as possible both from each other and from the critical probability 
pI^*^^. There can be several reasonable ways for choosing a suitable threshold. 
For example, we can propose to choose 9{N) as a maximizer of the following 
function: 



^ 2 

site 



(10) 



provided that (|S]) and ^ holds. Alternatively, we can propose to use a maximizer 
of 

5. Object detection 

We either observe a blank white screen with accidental noise or there is an 
actual object in the blurred picture. In this section, we propose an algorithm 
to make a decision on which of the two possibilities is true. This algorithm is 
a statistical testing procedure. It is designed to solve the question of testing 
Hq : lij = for all 1 < i, < iV versus Hi : lij — 1 for some i^j. 

Let us choose a{N) G (0, 1) - the probability of false detection of an object. 
More formally, a{N) is the maximal probability that the algorithm finishes its 
work with the decision that there was an object in the picture, while in fact 
there was just noise. In statistical terminology, a{N) is the probability of an 
error of the first kind. 

We allow a to depend on iV; a{N) is connected with complexity (and ex- 
pected working time) of our randomized algorithm. 

Since in our method it is crucial to observe some kind of percolation in the 
picture (at least within the image), the image has to be "not too small" in order 
to be detectable by the algorithm: one can't observe anything percolation-alike 
on just a few pixels. We will use percolation theory to determine how "large" 
precisely the object has to be in order to be detectable. Some size assumption 
has to be present in any detection problem: for example, it is hopeless to detect 
a single point object on a very large screen even in the case of a moderate noise. 

For an easy start, we make the following (way too strong) largeness assump- 
tions about the image: 

(Dl) Assume that the image { [i,j) 1 1 < i,i < A^, lij = 1 } contains a 
completely black square with the side of size at least (pim{N) pixels, where 

(12) 

ims art -generic ver. 2007/04/13 file: Raiidomized_Algorithins_aiid_Percolation_Square_3.tex date: January 13, 2013 



M. Langovoy and O. Wittich/Detection in noisy images and percolation. 9 



/T>o\ r VimjN) 

(D2) lim — -— — oo . 

N^oo logN 

(13) 

Furthermore, we assume the obvious consistency assumption 

^^m{N) <N. (14) 

Assumptions (-D1) and {D2) are sufficient conditions for our algorithm to work. 
They are way too strong for our purposes. It is possible to relax (jl3p and to 
replace a square in (-D1) by a triangle-shaped figure. 

Although the above two conditions are of asymptotic character, most of our 
estimates below are valid for finite N as well. Nevertheless, it is important to 
remark here that asymptotic results for N oo also have interesting practical 
consequences. More specifically, assume that physically we always have screens 
of a fixed size, but the resolution N'^ of our cameras can grow unboundedly. 
When N tends to infinity, we see that the same physical object that has, say, 
1mm in width and in length, contains more and more pixels on the pixelized 
image. Therefore, for high-resolution pictures, our algorithm could detect fine 
structures (like nerves etc.) that are not directly visible by a human eye. 

Now we are ready to formulate our Detection Algorithm. Fix the false detec- 
tion rate a (TV) before running the algorithm. 

Algorithm 1 (Detection). 

• Step 0. Find an optimal 0{N). 

• Step 1. Perform 6* (A^)— thresholding of the noisy picture 

• Step 2. Until 

{{Black cluster of size (pim(N) is found} 

or 

{all black clusters are found}}. 
Run depth- first search ( TarianI ( 1972[ )') on the graph Gn of 



the 6'(A^)-thresholded picture {F»,j}5=i 

Step 3. If a black cluster of size fimiN) was found, report that 
an object was detected 



• Step 4. If no black cluster was larger than ipi,n{N), report that 
there is no object. 

At Step 2 our algorithm finds and stores not only sizes of black clusters, but 
also coordinates of pixels constituting each cluster. We remind that 0{N) is 
defined as in ([5]), Gn and {i^ij}f^j=i were defined in Section [TJ and ipim{N) 
is any function satisfying ([T^ . The depth-first search algorithm is a standard 
procedure used for searching connected components on graphs. This procedure 
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is a deterministic algori thm. The deta iled description and ri gorous cornplexit y 
analysis can be found in lTarian (1972), or in the classic book Aho et aTl (|l975l) . 
Chapter 5. 

Let us prove that Algorithm 1 works, and determine its complexity. 

Theorem 1. Let a be 1-small. Suppose assumptions (Dl) and {D2) are satis- 
fied. Then 

1. Algorithm 1 finishes its work in 0{N'^) steps, i.e. is linear. 

2. there was an object in the picture, Algorithm 1 detects it with probability 
at least (1 — cyi-p[—Ci{a)ifim{N))). 

3. The probability of false detection doesn't exceed m\n{a{N),eiqi{~C2{cr)^Pim{N))} 
for all N > N{a). 

The constants Ci > 0, C2 > and N{a) G N depend only on a . 

Remark 5. Dependence on a implicitly means dependence on 0{N) as well, but 
this doesn't spoil Theorem [T] Remember that we can consider 0{N) to be a 
function of a in view of our comments before ([TU)) and pT|) . 

Theorem [1] means that Algorithm 1 is of quickest possible order: it is linear 
in the input size. It is difficult to think of an algorithm working quicker in this 
problem. Indeed, if the image is very small and located in an unknown place 
on the screen, or if there is no image at all, then any algorithm solving the 
detection problem will have to at least upload information about 0{N'^) pixels, 
i.e. under general assumptions of Theorem [l] any detection algorithm will have 
at least linear complexity. 

Another important point is that Algorithm 1 is not only consistent, but that 
it has exponential rate of accuracy. 



6. Proofs 

This section is devoted to provide complete proofs of the above results. Some 
crucial estimates from percolation theory are also presented for the reader's 
convenience. 

Proof. (Theorem [ij : 

Part I. First we prove the complexity result. Finding a suitable (approximate, 
within a predefined error ) 9 from (|10l) or (1111) takes a constant number of oper- 



ations. See, for example, Krvlov et al.l (|l97 



The 0(Af)— thresholding gives us {^ijlfj^i and Gn in 0{N'^) operations. 
This finishes the analysis of Step 1. 

As for Step 2 , it is known (see, for example, Aho et all ( 1975I ). Chapter 5, 



or iTarianI (Il972l) ') that the standard depth-first search finishes its work also in 



0{N'^) steps. It takes not more than 0{N'^) operations to save positions of all 
pixels in all clusters to the memory, since one has no more than N'^ positions 
and clusters. This completes analysis of Step 2 and shows that Algorithm 1 is 
linear in the size of input data. 
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Part II. Now we prove the bound on the probability of false detection. Denote 



p,^t{N) := , (15) 

a probability of erroneously marking a white pixel outside of the image as black. 
Under assumptions of Theorem [U Pout{N) < pf*-'^. 
We prove the following additional theorem: 

Theorem 2. Suppose that < Pout{N) < p^***^. There exists a constant C3 — 
CjipoutiN)) > such that 



Pp..t{N}{FN{n)) < exp( -uCsipoutiN))) , for all n > ^^„^{N) . (16) 

Here Fn^u) is the event that there is an erroneously marked black cluster of size 
greater or equal n, lying in the square of size N x N corresponding to the screen. 
(An erroneously marked black cluster is a black cluster on Gn such that each of 
the pixels in the cluster was wrongly coloured black after the 9— thresholding.) 

Before proving this result, we state the following theorem about subcritical 
site percolation. 

Theorem 3. (Aizenman- Newman) Consider site percolation with probability p^ 
on 1? . There exists a constant XsUe = ^siteipo) > such that 



Ppoi\C'\>n) < e 



= (po) 



for all n > 1 



(17) 



Here C is the open cluster containing the origin. 



Proof fTheorem[3|): See lBollobas and RiordanI p006f l. 



□ 



To conclude Theorem [21 from Th eor em [51 we w i ll use the celebrated FKG 
inequality (see Fortuin et al. (1971), or Grimmett ( 1999h . Theorem 2.4, p. 34; 
see also Grimmett 's book for some explanation of the terminology). 

Theorem 4. If A and B are both increasing (or both decreasing) events on the 
same measurable pair (fi, T), then P{A n B) > P{A) P{B) . 



Proof. (Theorem 12): Denote by C{i,j) the largest cluster in the N x N screen 
containing the pixel with coordinates {i, j), and by C(0) the largest black cluster 
on the N X N screen containing 0. By Theorem|3l for all i, j: 1 < i,j < N: 

Pp...iN){\Cm>n) < e-"^=-(f-), (18) 
Pp..dN)i\C{z,j)\>n) < e~"^"-(f-). 

Obviously, it only helped to inequalities (jT?]) and that we have limited our 
clusters to only a finite subset instead of the whole lattice Z^. On a side note, 
there is no symmetry anymore between arbitrary points of the N x N finite 
square; luckily, this doesn't affect the present proof. 
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Since { |C(0)| > n} and { |C(i,j)| > n} are increasing events (on the mea- 
surable pair corresponding to the standard random-graph model on Gat), we 
have that { |C(0)| < n} and { \C{i,j)\ < n} are decreasing events for all i, j. 
By FKG inequality for decreasing events, 

Pp.M{\C{t,j)\ <nior ii\\i,j,l<i,j <N) > 

n n Pp.^.iN){\C{t,j)\<n) > (by (HID) 



l<i,j<N 



> (1 



e 



We denote below by C? the "a out of 6" binomial coefhcient. It follows that 



Pp..AN){FN{n)) - PJ,„„,(^^)(3(^,J), 1 < ^,J < TV : |C(z,j)| > n) 



= l-^(-l)'=C^.e-"^"-(f°-)'= 

fc=0 

= ^(-lf-'C^.e-"^"-(''°-)'= 
fe=i 

= iv2e-"^""=(^'°"') + o(A^^e""-^""'(^'°"')) , 

because we assumed in that n > ipi„i{N), and logA^ = o{ipim{N)). More- 
over, we see immediately that Theorem [5] follows now with some C3 such that 

< C^iPoutiN)) < K^te{Pout{N)). □ 

The exponential bound on the probability of false detection follows from 
Theorem [2 

Part III. It remains to prove the lower bound on the probability of true detection. 
First we prove the following theorem: 

Theorem 5. Consider site percolation on lattice with percolation probability 
p > p'2*'^ ■ Let An be the event that there is an open path in the rectangle [0, n] x 
[0, n] joining some vertex on its left side to some vertex on its right side. Let Af„ 
be the maximal number of vertex-disjoint open left-right crossings of the rectangle 
[0,ri] X [0,ri]. Then there exist constants C4 — Ci{p) > 0, C5 = C^lp) > 0, 
Cq — Ce{p) > such that 



Pp(A„) > (19) 

Pp{AU <C5n) < e-^«", (20) 
and both inequalities holds for all n > 1 . 
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Proof. (Theorem [5|): One proves this by a sHght modification of the correspond- 
ing result for bond percolation on th e square lattice. See proof of Lemma 11.22 
and pp. 294-295 in lCrimmettl (|l999l) . 

□ 

Now suppose that we have an object in the picture that satisfies assump- 
tions of Theorem [T] Consider any ipim{N) x ipi^^^N) square in this image. After 
0— thresholding of the picture by Algorithm 1, we observe on the selected square 
site percolation with probability 



Then, by (ITOl) of Theorem [S) there exists C4 = C4{pim{N)) such that there will 
be at least one cluster of size not less than ifim{N) (for example, one could 
take any of the existing left-right crossings as a part of such cluster), provided 
that N is bigger than certain N{pim{N)) — N{a); and all that happens with 
probability at least 

l-ne"^^" > l-e"'^^", 
for some C3: < C3 < C4. Theorem [T] is proved. 

□ 
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